A simple yet accurate correction for winner's curse can predict signals discovered in much larger genome scans
نویسندگان
چکیده
MOTIVATION For genetic studies, statistically significant variants explain far less trait variance than 'sub-threshold' association signals. To dimension follow-up studies, researchers need to accurately estimate 'true' effect sizes at each SNP, e.g. the true mean of odds ratios (ORs)/regression coefficients (RRs) or Z-score noncentralities. Naïve estimates of effect sizes incur winner's curse biases, which are reduced only by laborious winner's curse adjustments (WCAs). Given that Z-scores estimates can be theoretically translated on other scales, we propose a simple method to compute WCA for Z-scores, i.e. their true means/noncentralities. RESULTS WCA of Z-scores shrinks these towards zero while, on P-value scale, multiple testing adjustment (MTA) shrinks P-values toward one, which corresponds to the zero Z-score value. Thus, WCA on Z-scores scale is a proxy for MTA on P-value scale. Therefore, to estimate Z-score noncentralities for all SNPs in genome scans, we propose F: DR I: nverse Q: uantile T: ransformation (FIQT). It (i) performs the simpler MTA of P-values using FDR and (ii) obtains noncentralities by back-transforming MTA P-values on Z-score scale. When compared to competitors, realistic simulations suggest that FIQT is more (i) accurate and (ii) computationally efficient by orders of magnitude. Practical application of FIQT to Psychiatric Genetic Consortium schizophrenia cohort predicts a non-trivial fraction of sub-threshold signals which become significant in much larger supersamples. CONCLUSIONS FIQT is a simple, yet accurate, WCA method for Z-scores (and ORs/RRs, via simple transformations). AVAILABILITY AND IMPLEMENTATION A 10 lines R function implementation is available at https://github.com/bacanusa/FIQT CONTACT: [email protected] SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
منابع مشابه
Winner's Curse Correction and Variable Thresholding Improve Performance of Polygenic Risk Modeling Based on Genome-Wide Association Study Summary-Level Data
Recent heritability analyses have indicated that genome-wide association studies (GWAS) have the potential to improve genetic risk prediction for complex diseases based on polygenic risk score (PRS), a simple modelling technique that can be implemented using summary-level data from the discovery samples. We herein propose modifications to improve the performance of PRS. We introduce threshold-d...
متن کاملIllustrating, Quantifying, and Correcting for Bias in Post-hoc Analysis of Gene-Based Rare Variant Tests of Association
To date, gene-based rare variant testing approaches have focused on aggregating information across sets of variants to maximize statistical power in identifying genes showing significant association with diseases. Beyond identifying genes that are associated with diseases, the identification of causal variant(s) in those genes and estimation of their effect is crucial for planning replication s...
متن کاملThe projack: a resampling approach to correct for ranking bias in high-throughput studies
The problem of ranked inference arises in a number of settings, for which the investigator wishes to perform parameter inference after ordering a set of [Formula: see text] statistics. In contrast to inference for a single hypothesis, the ranking procedure introduces considerable bias, a problem known as the "winner's curse" in genetic association. We introduce the projack (for Prediction by Re...
متن کاملFIQT : a simple , powerful method to accurately estimate effect sizes in genome scans
Genome scans, including both genome-wide association studies and deep sequencing, continue to discover a growing number of significant association signals for various traits. However, often variants meeting genome-wide significance criteria explain far less of the overall trait variance than “sub-threshold” association signals. To extract these sub-threshold signals, there is a need for methods...
متن کاملSelection Bias, Demographic Effects, and Ability Effects in Common Value
We find clear demographic and ability effects on bidding in common value auctions: inexperienced women are much more susceptible to the winner's curse than men, controlling for SAT/ACT scores and college major, but they catch up quickly; economics and business majors substantially overbid relative to other majors; and those with superior SAT/ACT scores are much less susceptible to the winner's ...
متن کامل